AITopics | new voice

Collaborating Authors

new voice

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Google smart speakers are starting to sound like Gemini

PCWorldDec-3-2024, 15:51:19 GMT

A smattering of Google Home users are reporting that their Nest speakers are--when asked the right voice command--chatting with a new voice, a sign that the promised Gemini makeover for Google Assistant is starting to roll out. In a video posted on Reddit, a Google Nest Mini user asked "Hey Google, what's up," and got an unusually loquacious reply in a new voice: "What's happening right now is that we're on a giant rock moving through space at 1,000 miles an hour and orbiting a giant star made up mostly of hydrogen. Also, we're chatting, which I enjoy." When the Nest user asked a more basic follow-up question about the weather, Google Assistant answered in its regular voice with a typical weather report. According to 9to5Google, you can tell if the Gemini-enhanced Assistant has made its way to your Nest speakers by asking, "Hey Google, what's up?"

google assistant, google smart speaker, nest speaker, (7 more...)

PCWorld

Industry: Appliances & Durable Goods (0.42)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (0.94)

Add feedback

OpenAI released its advanced voice mode to more people. Here's how to get it.

MIT Technology ReviewSep-24-2024, 19:08:04 GMT

The update also adds new voices. Shortly after the launch of GPT-4o, OpenAI was criticized for the similarity between the female voice in its demo videos, named Sky, and that of Scarlett Johansson, who played an AI love interest in the movie Her. OpenAI then removed the voice. Now it has launched five new voices, named Arbor, Maple, Sol, Spruce, and Vale, which will be available in both the standard and advanced voice modes. MIT Technology Review has not heard them yet, but OpenAI says they were made using professional voice actors from around the world.

advanced voice mode, openai, spokesperson, (6 more...)

MIT Technology Review

Country:

Europe > Switzerland (0.07)
Europe > Norway (0.07)
Europe > Liechtenstein (0.07)
Europe > Iceland (0.07)

Industry:

Media > Film (0.60)
Leisure & Entertainment (0.60)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (1.00)

Add feedback

Scarlett Johansson Says OpenAI Ripped Off Her Voice for ChatGPT

WIREDMay-20-2024, 23:38:57 GMT

Last week OpenAI revealed a new conversational interface for ChatGPT with an expressive synthetic voice strikingly similar to that of the AI assistant played by Scarlett Johansson in the sci-fi movie Her--only to suddenly disable the new voice over the weekend. On Monday, Johansson issued a statement claiming to have forced that reversal, after her lawyers demanded OpenAI clarify how the new voice was created. Johansson's statement, relayed to WIRED by her publicist, claims that OpenAI CEO Sam Altman asked her last September to provide ChatGPT's new voice but that she declined. She describes being astounded to see the company demo a new voice for ChatGPT last week that sounded like her anyway. "When I heard the release demo I was shocked, angered, and in disbelief that Mr. Altman would pursue a voice that sounded so eerily similar to mine that my closest friends and news outlets could not tell the difference," the statement reads.

large language model, machine learning, natural language, (14 more...)

WIRED

Country: North America > United States > New Hampshire (0.06)

Industry:

Law (0.73)
Media > Film (0.37)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (1.00)

Add feedback

Creating New Voices using Normalizing Flows

Bilinski, Piotr, Merritt, Thomas, Ezzerg, Abdelhamid, Pokora, Kamil, Cygert, Sebastian, Yanagisawa, Kayoko, Barra-Chicote, Roberto, Korzekwa, Daniel

arXiv.org Artificial IntelligenceDec-22-2023

Creating realistic and natural-sounding synthetic speech remains a big challenge for voice identities unseen during training. As there is growing interest in synthesizing voices of new speakers, here we investigate the ability of normalizing flows in text-to-speech (TTS) and voice conversion (VC) modes to extrapolate from speakers observed during training to create unseen speaker identities. Firstly, we create an approach for TTS and VC, and then we comprehensively evaluate our methods and baselines in terms of intelligibility, naturalness, speaker similarity, and ability to create new voices. We use both objective and subjective metrics to benchmark our techniques on 2 evaluation tasks: zero-shot and new voice speech synthesis. The goal of the former task is to measure the precision of the conversion to an unseen voice. The goal of the latter is to measure the ability to create new voices. Extensive evaluations demonstrate that the proposed approach systematically allows to obtain state-of-the-art performance in zero-shot speech synthesis and creates various new voices, unobserved in the training set. We consider this work to be the first attempt to synthesize new voices based on mel-spectrograms and normalizing flows, along with a comprehensive analysis and comparison of the TTS and VC modes.

new voice, speech synthesis, synthesis, (15 more...)

arXiv.org Artificial Intelligence

doi: 10.21437/Interspeech.2022-10195

2312.14569

Country: Europe > Italy > Calabria > Catanzaro Province > Catanzaro (0.04)

Genre: Research Report (0.64)

Industry: Information Technology > Security & Privacy (0.34)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Speech > Speech Synthesis (0.80)

Add feedback

Ever wanted to hear Donald Trump speaking Hindi? Try the AI tool that can clone anyone's voice

Daily Mail - Science & techMay-2-2023, 14:00:44 GMT

He has one of the most instantly recognisable voices in Britain, but have you ever wondered what David Attenborough would sound like speaking German? Well, now you can find out, thanks to a new AI tool that can clone anyone's voice and make them say anything in multiple languages. The tool, by ElevenLabs, requires just a few seconds of audio, and even maintains the speaker's original tone of voice. Creators hope this will'expand the horizons' in numerous fields including publishing, game development and the media. You can try it yourself on ElevenLabs' website using your own voice or that of your favourite celebrity!

donald trump, elevenlab, hear donald trump, (12 more...)

Daily Mail - Science & tech

Country:

North America > United States (0.41)
Europe > United Kingdom (0.25)

Industry:

Government > Regional Government > North America Government > United States Government (0.41)
Leisure & Entertainment > Games > Computer Games (0.38)
Information Technology > Software (0.38)
Information Technology > Security & Privacy (0.33)

Technology: Information Technology > Artificial Intelligence (1.00)

Add feedback

New voice cloning AI lets "you" speak multiple languages

#artificialintelligenceMar-20-2023, 09:10:08 GMT

This article is an installment of Future Explored, a weekly guide to world-changing technology. You can get stories like this one straight to your inbox every Thursday morning by subscribing here. In January, Microsoft unveiled an AI that can clone a speaker's voice after hearing them talk for just three seconds. While this system, VALL-E, was far from the first voice cloning AI, its accuracy and need for such a small audio sample set a new bar for the tech. Microsoft has now raised that bar again with an update called "VALL-E X," which can clone a voice from a short sample (4 to 10 seconds) and then use it to synthesize speech in a different language, all while preserving the original speaker's voice, emotion, and tone.

microsoft, speak multiple language, voice clone, (15 more...)

#artificialintelligence

Industry:

Media (1.00)
Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Happy International Women's Day!

AIHubMar-8-2023, 11:10:08 GMT

To celebrate International Women's Day, we take a look back over the past year and highlight some of the women we've interviewed, written about, chatted to, and featured on AIhub. Rose Nakasi is a Lecturer of Computer Science and a Research Scientist at the Makerere Artificial Intelligence Lab, in Makerere University, Uganda. She holds a PhD in Computer Science from Makerere University. Her research interests are in artificial intelligence and data science, and particularly in the use of these for developing improved automated tools and techniques for microscopy diagnosis of diseases like malaria in low-resourced but highly endemic settings. We spoke to Rose Nakasi about her work developing machine learning techniques to aid diagnosis of microscopically diagnosed diseases: Interview with Rose Nakasi: using machine learning and smartphones to help diagnose malaria.

computer science, new voice, university, (16 more...)

AIHub

Country:

Africa > Uganda (0.25)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.15)
Europe > Ireland > Leinster > County Dublin > Dublin (0.15)
(8 more...)

Genre: Personal (0.71)

Industry:

Education > Educational Setting (0.48)
Health & Medicine > Epidemiology (0.45)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Issues > Social & Ethical Issues (0.96)

Add feedback

AIhub monthly digest: January 2023 – low-resource language projects, Earth's nightlights and a Lanfrica milestone

AIHubJan-31-2023, 10:30:31 GMT

Welcome to our January 2023 monthly digest, where you can catch up with any AIhub stories you may have missed, get the low-down on recent events, and much more. This month, we highlight some of the projects pertaining to low-resource languages, hear about counterfactual explanations for land cover mapping, and find out about machine learning techniques for night-time remote sensing. We are delighted to share the second article in our focus series on "AI around the world": Natural Language Processing for low-resource languages. This time we enter the domain of natural language processing and highlight some of the work and initiatives being carried out on low-resource languages. In our latest episode of New voices in AI, Srija Chakraborty tells us about her work applying machine learning techniques to night-time remote sensing for measuring nightlights from a variety of natural and artificial sources.

machine learning, monthly digest, natural language, (9 more...)

AIHub

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Explanation & Argumentation (0.40)

Add feedback

This Voice Doesn't Exist - Generative Voice AI

#artificialintelligenceJan-13-2023, 10:25:09 GMT

Recently it seems everybody is talking about generative AI. Deep learning-powered large language and text-to-image models like ChatGPT, Stable Diffusion, DALL-E and Midjourney have caused much fuss in the tech world, and beyond. Many include them among the most significant recent developments in AI. Whether or not you agree, the general sentiment seems to be that something very all-powerful has appeared. In 2023 we'll hear about models that can help you draw or create videos.

generative voice ai, new synthetic voice, new voice, (10 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.55)

Add feedback

A year of new voices in AI

AIHubDec-22-2022, 11:00:08 GMT

There were nine interviews over 2022. What started as a pile of loose post-it notes ideas has transformed into 9 interviews over the course of 2022. It has been a great privilege to speak to so many great researchers this year, here is a quick summary of all the interviews covering everything from NLP, to conservation, to swarm robotics. The series began with David Adelani, talking about his work on NLP for low resource languages. In the second episode Isabel Cachola talked about how she got into AI and her work on interpretability of NLP models.

interview, journey, new voice, (3 more...)

AIHub

Technology: Information Technology > Artificial Intelligence (1.00)

Add feedback